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An Extension of the Two-Parameter Logistic Model 
to the Multidimensional Latent Space 



Item response theory CIRT) has proven to be a very powerful and useful 
measurement tool. However, most of the IRT models that have been proposed, 
and all of the models commonly used, require the assumption of unidimensionality, 
which prevents their application to a wide range of tests. The few models 
that have been proposed for use with multidimensional data have not been 
developed to the point that they can be applied in actual testing situations. 
The purpose of this report is to present a model for use with multidimensional 
data and to discuss some of its characteristics. This discussion will 
include information on the interpretation of the model parameters^ the 
sufficient statistics for the model parameters, and the information function 
for the model. In addition, a procedure for estimating the parameters of 
the model will be discussed. 



The Modftl and Its Characteristics 



The Model 



The model proposed in this report is a multidimensional extension of 
the two-parameter logistic model. The two-parameter logistic (2PL) model, 
proposed by Birnbaum (1968), is given by 

exp(Da^(ej - b^)) 

where a. is the discrimination parameter for item i, b. is the difficulty 
parameter for item i, 9. is the ability parameter for examinee j\ r^(6j) is 

the probability of a correct response to item i by examinee j, and D = 1.7. 
The multidimensional extension of the 2PL model (M2PL), as presented by 
McKinley and Reckase (1982), is given by 

exp (d + a , £ , ) 

where a^ is a row vector of discrimination parameters for item i, 9^ is a 
column vector of ability parameters for examinee j, P^t^j' the probability 
of a correct response to item i by examinee j, and d^^ is given by 

where ^i.j^ is the discrimination parameter for item i on dimension k, b^j^ is 

the difficulty parameter for item i on dimension k, and m is the number of 
dimensions being modeled* The d. term, then, is velated to item difficulty. 
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but is not e difficulty parairtter in the same sense as the b, parameter is 
in the unidimensional model. 

Interpretation of the Model Parameters 

The interpretation of the parameters of unidimensional IRT models is 
closely tied to the itep characteristic curve (the regression of item score 
on ability). The item difficulty parameter is defined as the point on the 
ability scale wtiere the point of inflection of the item characteristic 
curve (ICC) occurs. This is equivalent to saying the item difficulty value 
is the point on the ability scfde where the second derivative of the ICC 
function is equal to zero. For the 2PL model, the second derivative is 
given by 



where P is the probability of a correct response to item i given ability j# 
Q = 1-P, and and D are as previously defined. Setting the right hand 

side of (4) equal to zero yields a solution of P = Q = 0.5. Of course, 

P = 1.0 and P = 0.0 are also solutions, bat these represent degenerate 

cases where 6 =i +<» and 6 = respectively. Thus, the point of inflection 

occurs at P = 0.5, which occurs where b. = 0.. The difficulty of an item 

1 J 

for the 2PL model, then, is the point on the ability scale which yielcis a 
probability of a correct response equal to 0.5. Figure 1 shows a typical 
ICC for the 2PL model. The dotted line shows the relationship among the 
item difficulty value, ability, and the probability of a correct response. 



= D2a^2pQ(i _ 2P) , 




(A) 



Figure 1 



l.Oi 



A Typical ICC for the 2PL Model 
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The item discrimination parameter is related to the slope of the ICC 
at the point of inflection. The slope of the ICC at the point of inflection 
is found by taking the first derivative of the ICC and evaluating it at the 
point of inflection. For the 2PL model the first derivative is given by 



||-=Da^PQ, (5) 

where all the terms are as previously defined. It was previously fo»ind 
that the pdint of inflection for the 2PL model occurs where P = 0-5. 
Substituting 0.5 into (5) yields a slope at the point of inflection of 
Da./4. 

Difficulty and discrimination are defined somewhat differently for 
multidimensional models. To begin with, the response function (the model) 
defines a multidimensional item response surface (iRS) rather than a curve. 
This surface may have many points of inflection, and the points of inflection 
may vary depending on the direction relative to the 6*axes. Because of this, 
the item parameters for the M2PL model arc <lefined in terms of directional 
derivatives (Kaplan, 1952)- 

For multidimensional models, difficulty is defined as the locus of 
points of inflection of the IRS for a particular direction. This is found 
by taking the second directional derivative of the response function, 
setting it ^qual to zero, and solving for the 6-vector. The second directional 
derivative for the M2PL model is given by 



V/P 



2d = 



<S^P 



62p 



6^P 



* *- *^ m 



_6fP_ 



COS4>2 COS'tl + 



6^P 

costbo cos<fr 



(6) 



6e^ 



whei'e j> represents the vector of angles with respect to each of the m axes. 
Solving the derivatives in (6) and simplifying yields 

V ''P = PQ(1 - 2P) (aicos(t»i + a2COS(t»2 + . . . + a^cos<(>^) , (7) 
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Setting (7) equal to zero and solving yields three solutions--? " 0.0, 
P = 0.5, and P = 1.0. The solutions 0.0 and 1.0 represent degenerate cases 
where 6 = ±»* P = 0.5 occurs when the exponent of the H2PL model is zero. 
That is P " 0*5 when 



d + ai^i + ^2^2 + * * * + a 0 = 0. (8) 

m m 

In the two-dimensioaal case this is the equation for a liae* 

For the M2PL model, as caa be seea from the above derivations, the 
direction, ij, falls out of the equations* Item difficulty for the M2PL model 
is the sarae for all directions of travel* This is not necessarily the case 
for all multidimensional models. 

Figure 2 shows a typical response surface for the M2PL model in the 
two-diraensional case. The dotted line indicates the line of difficulty* 
In the m-dimensional case (8) is the equation for a hypeq^laae. 



Figure 2 

A Typical Response Surface for the M2PL Model 
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For multidimensional models^ item discrimination is a function of the 
slope of the IRS at the locus of points of inflection in a particular 
direction. This is obtained by taking the first directional derivative of 
the response function and evaluating it at the locus of points of inflection. 
For the H2PL model the first directional derivative is given by 

V ^ ^ ^ ^ ' 

where ^ represents the vector of angles of the direction in the d-space 
vith respect to each of the m axes^ For the two-dimensional case (9) is 
given by 

y = aiFQ cos ^ + a^PQ sin ^ , (10) 

where ^ is the angle with the 6t axis. When ^ ^ 0^ (direction parallel to 
6| axis) the slope is aiPQ» and when ^ ^ 90^ (parallel to 62 axis) the slope 
is a2PQ- In general, when the direction is parallel to the 6^ axis, the slope 

is 3^PQ- Since P - Q = 0.5 at the line of inflection, the slope parallel to 

the 9 axis at those points is a /4. In the unidimensional case 4 = 0^, and 
m m 

the slope of the ICC at the point of inflection is Da/4+ 

Sufficient Statistics 

Definition Assume that there exists some distribution that is of 
known form except for some unknown parameter 6, and that x represents a set 
of observations from that distribution. Also assume that S(x) is some 
statistic which is a function of x. If S(x) is a sufficient statistic for 
3, then it must be possible to factor the probability function of x, P(x)d) 
into the form: 

Pfx|6) ^ f(S(x)l8lg(x). (U) 

In this form it is easy to see that g(x) is independent of 9, and so provides 
no information about 6. Selection of 6 to maximise the probability of x is 
tantamount to selecting 8 to maximize the probabilty of S(x). 

In item response theory x is typically a response strings either by 
one examinee to a set of items 01* by a set of examinees to a single item. 
In this case, P(x)9) is the likelihood of the response string. For the 
M2PL model, the likelihood of an examinee's response string is given by 

P(x.l9.) ^ n P(x..|9.) (12) 
J J i~l 

where x^^ is the response to item i by examinee 6^ is the vector of 

abilities for examinee j * x^ is the response string for examinee and n 

is the number of items. The likelihood of the set of responses to an item 
is given by: 
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(13) 



where P(x^j]d^j a^) is the probability of response x^j for item i, and 

a^ are the item parameters for item i, is the vector of responses to 

item i) and N is the number of examinees. In order for any statistic to be 
a sufficient statistic for a parameter of the M2PL model^ it must be possible 
to factor the appropriate likelihood function into the form given by Cll). 

Sufficient Statistic for the Ability Parameter For the M2PI model 
(12) can be factored into the form: 

n n n 

pCx, li J = n Q.(e . )exp(e i: a . x )exp( i: d x ). (^^x 
-j j 1=1 ^ J J i=l ^ i=l ^ 

From (14) it can be seen that 



(15) 
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is a vectof of sufficient statistics for 9,. (For a discussion of the 

J < 

derivation of the suffxcient statistic for ability in the unidimensional 
case* see Lord and Novick^ 1968* chapter 16). 

Sufficient Statistics for the Item Parameters For the item parameters 
of the M2PL model » (13) can be factored into the form: 

. N N N 

P(Xi!d^.a^) = n Q (d^,a )exp(a 2 i , x )exp(d i: x^ J . (i6) 

j=i j=>i J J j=i -^j 

From Cl6) it can be seen that 

N 

is a sufficient statistic for the d-parameter, and 

s^(x^) = J^ijX^j (18) 
is a vector of sufficient statistics for the a-paramcter. 

Information Function 

Definition In item response theory the precision of estimates based 
on a given scoring formula are generally described in terms of the information 
function of the scoring formula. The information function of a particular 
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scoring formula, as given by lord and Kovick (1968), is given by 



I [o,sCx)] = 



a2 [}cx),e] 



1 



IgE [scx)le] 



(19) 



where sCx) represents a given scoring formula for the model of interest, 
o^(s(X),9] is the variance of the scoring formula, and the derivative 
3E[sCX)|8l/38 specifies how the mean of the scoring fonwila changes as 8 
changes ^ 

If sCx) takes the form 



sCx) = Z w^x^ , 



i-1 



(20) 



where w. is a positive number, then the expected value E[s(X)|8] is given 
by 



E [s(X)|e1 = z w.p.ce), 
J 1=1 ^ ^ 



(21) 



and the variance of the scoring formula is given by 



[scx),e] = J^w^2p^(e)Q^ce). 



(22) 



(For a discussion of these derivations, see Lord and Hovick, 1968)* 
Substituting (2l) and (22) into (19) yields 



I [e,sCx)3 = 



z w 2p (e)Q ce) 
1=1 1 1 1 



-1 



"w^P^-CO) 
i=l 



where P. '(8) =3P.(0)/38. For a single item (23) takes the form 



(23) 



I [e,sCx)] = p^'Ce)Vp^Ce)Q^ce) , 



(24) 



which is the item information function. If (24) is written in terms of the 
response x,, rathet than the scoring formula s(x), the same result is 
obtained. ^That is, 1(8, x.) = 1(6, s(x.)]. lord and Novick (1968) have 
shown that, unless s(x) represents the locally best weights at 8» 
I[8,s(x)] < 1 1(6, X.). That is, the sum of the item information functions, 
which is independent of the the scoring formula, represents an upper bound 
on each and all information functions obtained using different scoring 
formulas. The suia of the item information functions is called the test 
information function, and is given ly 
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n n 

1(e) = i: i(e,x ) = i: p'(6)^/p (6)q (e). (25) 
1=1 ^1=1 ^ ^ 

Infonnation Functions for the M2PL Model For the unidimensional 2Pl 
model, given by (1), the item infomttion function is given by 

I(6*x^) = D^a^^P^(0)Q^(6) . (26) 

Test information for the unidimensional 2PL model is given by 

n 

1(e) - i: D^a.^p.(e)Q,(e). (27) 

As was the case for discrimination^ information for the M2PL model 
varies depending on the direction relative to the 9-axes* Therefore, 
item and test infonnation for the H2PL model are defined using the first 
directional derivative of the response function, which is given by (9)^ 
Item information for the M2PL oodel is given by 



1(6, x^) = ai^ PQ cos^^i + PQ cos^^2 + * . * + a ^ PQ cos^^ + 
*■ 1 * *■ m m 

2aia2 PQ cos^i cos<Jp + , + + + 2ata PQ cos<Ji cos<J + 



(28) 



For the two dimensional case, this simplifies to 



I(e_,x^) = l'Q(aicos^ + a2Sin^) * 



(29) 
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Note that when the direction of travel is parallel to the 9*axis (<J> = 0**)> 
item information is given by a^^PQ. When only 62 is of interest (<J> = 90^) > 
item information is given by a2^PQ* If the two dimensions are weighted equally 
((> = 45*^), item information is given by 0.5(ai^PQ + 2ai a2PQ + a2^PQ)- Figures 
3> 4^ and 5 show typical item information surfaces for ^ = 0*^, 45^, and 90** 
respectively. Note that these are not the same surface seen from different 
angles. They are different surfaces^ all for the same item^ obtained by 
varying the direction with respect to the 9-axes+ As can be seen^ they are 
quite different. Test information for the M2PL model is simply the sum of 
(29) over all of the items. Figures 6, 7, and 8 show typical test information 
surfaces for ^ = 0^, 45**, and 90**, respectively* Again^ the three surfaces 
are quite different, indicating that the test gives different amounts of 
information that are concentrated at different places in the 9-space when 
different weighted composites of ability are of interest* 



Maximum Likelihood Estimatic^n 

Maximum likelihood estimation of the parameters of the M2PL nodel is 
relatively straightforward* The likelihood of a response matrix for the 
H2PL model (or for any latert trait model) is given by 

n N 

L - n n P(x_) * (30) 

where all the terms have been previously defined* For an examinee's response 
string, the likelihood is given by (12), and the likelihood of a response 
string for an item is given by (13)* The first derivative of tfie log of 
the likelihood given in (12) is given by: 

5Iog L. n n 

66^ i-1 ^ i»l ^ 

and the first derivative of the log of the likelihood given in (13) is 
given by * 

Slog L N 

^= ^ \. (32) 

6d^ j=l 

for the difficulty parameter^ and 

61og L N N 

— = ^ - ^ (33) 

for the discrimination parameter. 

The estimation of ability using maximum likelihood techniques simply 
involves setting (31) equal to zero and solving for 6^. Of course, since 

this involves solving simultaneous nonlinear equations^ some type of 
iterative procedure is generally required* The estimation of item parameters 
involves setting (32) and (33) equal to zero and solving for and a., 
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Figure 3 



An Item Information Surface 
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Figure 5 



An Item Information Surface 
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Figure 7 
A Test Information Surface 
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respeeLively- Again, tiie solution of simultaneous nonlinear equations 
requires an iterative procedure. HcKinley and Reckase (1983) describe a 
procedure for the simultaneous estiniation of the item and person parameters 
of the M2PL model using a Newton-Raphson procedure for solving the 
simultaneous nonlinear equations, A computer program is available. 



Discussion 

Although IRT has gained popularity over the last few years, applications 
of IRT models liave been limited to tests for which the assumption of 
uni^limensionalily is at least defensible. There have been a few IRT models 
proposed for use with multidimensional data (see HcKinley and Reckase, 1982, 
for a suininary), but there have been few successful attempts at their 
application. Use of these models has been limited due to the absence of 
practical algorithms for parameter estimation, and, at least in part, because 
the models are not well understood, 

HcKinley and Reckase (1982) have proposed a model, the H2PL model> for 
use with multidimensional data, and they have developed a program for the 
estimation of the parameters of the model (HcKinley and Reckase, 1983), 
The purpose of this report is to provide information necessary for the 
understanding and use of the M2PL model. 

Many of the characteristics of the M2PL model are not straightforward 
extension,s from the uridimensional case. Rather, the unidimensional case 
is a special case of the multidimensional model in which much of the richness 
and complexity o£ the model is not evident. Because of this, some of the 
characteristics of the model described in this report may be somewhat 
difficult to grasp. In order to aid in the understanding of these character- 
istics, they will now be discussed in some depth. An attempt will be made 
in each case to describe how the information provided relates to real-world 
applications. The discussion will begin with the interpretation of the 
model parameters, and will include the sufficient statistics, information 
functions, and parameter estimation. Before beginning the discussion of 
the characteristics of the H2PL model^ however, a brief discussion of 
directional derivativei> will be presented, since directional derivatives 
are so important to the understanding of multidimensional IRT models. 



Directional Derivatives 

One of the most interesting and complex aspects of the multidimensional 
IRT models which is lost when the unidimensional case is discussed is the 
notion of directional derivatives. In the unidimensional case the only 
direction ever discussed is parallel to the 6-axis = 0**), in which case 
all the trigonometric terms so evident in the material presented above are 
absent — they always equal 1,0 or 0,0 and therefore drop out of the equations. 

Directional derivatives are necessary in the multidimensional case 
because the derivatives of the response function vary depending on the 
direction taken relative to tr.e 6-axes. The first derivative of a function 
gives the slope of the function at any given point. The second derivative 
gives the rate at which the slope is changing at a particular point. The 
point of maximum slope is where the slope stop,s increasing and starts 
decreasing. At the point where that change occurs, the change in slope 
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crosses from bciRg positive (increasing) to negative (decreasing). ThuSj 
at that point the change in slope is neither positive nor negatwe^ but 
rather is zero. Since the second derivative gives the rate of change of 
slope, the point of maximum slope is where the second derivative is zero. 
In the unidimenslonal case, this has a straightforward application in 
determining item difficulty and discrimination, as illustrated in Figure 1. 
The point where the dotted line crosses the ICC is the point of maximum f^lope 
and minimum (zero) change in slope. 

In Figure 2, it can easily be seen that there is no one point on the 
surface where the slope is at a maximum^ Moreover, for any one point, the 
slope varies depending on the direction. Consider the point on the surface 
where 9i = 0.0 and 62 = *2.5. This point is indicated on the surface by an 
X. Moving along the B2 = -2.5 line parallel to the 61 axis, the surface is 
rising fairly rapidly at the point indicated. However, moving along the 

61 = 0 line parallel to the 62"^^^^^, the surface is still relatively flat 
and is rising slowly. Clearly the slope of the surface is different 
depending on the direction of travel. The same is true of the change in 
slope. Because of this, when taking derivatives of a multidimensional 
response function, it is necessary to consider the direction. Directional 
derivatives are a way of doing this. The actual interpretation of the 
derivatives in different directions will be addressed in the next section, 
since it is closely related to the interpretation of the model parameters. 

Interpretation of the Hodel 

A straightiorward extension of item difficulty from the unidimensional 
to the multidimensional case would seem to lead to the conclusion that 
difficulty in the multidimensional case ought to be a vector of b-parameters , 
with one b for each dimension. In Figure 1 the b-paramete?- is the point on 
the e-scale below the point of inflection. It represents the point on the 
6-scale t>rhere the item best discriminate.s between high and low ability. At 
the point represented by the b-parameter, a very small change in ability 
corresponds to a large change in the probability of a correct response. 
Nowhere on the 6-scale does an equal change in ability result in as large a 
change in the probability of a correct response. Thus, in the unidimensional 
case, the item difficulty parameter indicates the point on the ability scale 
at which the item does the best job of discriminating between different 
levels of ability. 

On the surface, it seems logical to conclude that if there are two 
dimensions, there should be two b-parameters. One b*parameter should 
indicate the point of maximum discrimination on one dimension, while the 
other b-parameters indicate the point of maximum discrimination on the 
other dimension. Figure 2, however, clearly illustrates that tliis is 
inadequate . 

As can be seen in Figure 2^ the two ability dimension.s do not act 
independently. It is the combination of ability on the two dimensions 
which determines the probability of a correct response. An examinee with 

62 - 2.0 clearly has a higher ability on that dimension than an examinee 
with 62 = -2.0. However, if the second examinee has 0^ 3.0, while the 
first examinee has Bi = -3.0, the second examinee has a much higher 
probability of a correct response to the item described by the IRS. Clearly, 
then, considering the two dimensions separately does not contribute greatly 
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to discriminating between examinees who have different probabilities of a 
correct response. This is reflected in the fact that the item difficulty 
for Figure 2 is a line which is not parallel to cither axis. 

This has important implications for test construction and analysis 
using the M2PL model- It is conunon practice, for instance, to order items 
on a test by difficulty, or to construct a test having a specified 
distribution of item difficulty. In the unidimensional case this is done 
using the b-parameter. Clearly in the multidimensional case it is more 
complicated. An item having a smaller d-parameter than a second item is only 
uniformly more difficult than the second item if their difficulty functions 
are parallel. If the difficulty functions intersect, then item 1 is more 
difficult than item 2 in some regions of the S-plane, while item 2 is more 
difficult in other regions. 

This would seem to indicate that it is only reasonable, in the multi- 
dimensional case, to talk about ordering items on difficulty or obtaining a 
specified distribution of difficulty if all the items to be considered have 
parallel lines of difficulty^ Of course, in the m^dimensional case the 
items would all have to have parallel (m-^1 )^dimensioiial hyperplanes. 

In order to determine whether two items have parallel lines of difficulty 
in the two-dimensional case, first determine the form of the difficulty line. 
The equation for the line of difficulty is given by (8), The two lines are 
parallel only if the slopes of the lines are equal. Putting (8) into a 
slope*intercept form yields 

= - — e,, + — > (34) 

where a^^ is the item discrimination parameter for item i for dimension 1, 

a^2 dimension 2, 9^^ is the ability parameter for examinee j for 

dimension 1, and ability parameter for dimension 2. If item 2 is 

denoted by a prime C^), then the lines of difficulty for items 1 and 2 are 
parallel only if 

■^■-^ ■ (35) 

\2 ^12 



If all items meet the condition set out in (35), then they can be ordered 
by difficulty, by simply ordering them by their d-parameters. 

The ordering of items on difficulty implies that there is some underlying 
variable being measured that has some correspondence to the criterion used 
for the ordering. In this case there is some composite of the 9s which 
corresponds to the difficulty continuum formed by the items having parallel 
lines of difficulty. The composite is determined by the orientation of the 
lines of difficulty. 

The extension of item discrimination to the multidimensional case is 
even more complex than the extension of item difficulty. Unlike difficulty, 
the concept of item discrimination in the multidimensional case includes a 
consideration of direction — the angles indicating the direction do not fall 
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out of tue equations. Although the slope of the IRS shown in Figure 2 is 
constant all along the the line of difficulty for a given direction, it 
varies with different uirections. 

The need to consider direction has important implications for both 
test construction and test analysis. It is not enough in test construction, 
for instance, to merely select the item with the highest discrimination 
from aiTiong the available items. One item is uniformly more discriminating 
than a second item only if the slope of its IRS at the points of inflection 
is greater than the slope of the IRS for the second item for all directions. 
If item 1 has the higher a*value on one dimension, hut a lower a'value on 
another dimension, there may be directions for which the slope of the IRS at 
the points of inflection Will be greater for item 2. For example, consider 
the case where item 1 has discrimination parameters a = (t.O^ 0.5) and item 
2 has discrimination parameters a - (0.5^ 1.0). When ^ in (10) is 30 degrees^ 
the slope of the IRS at the points of inflection is 0.279 for item 1 and 0.233 
for item 2. When ^ is 60 degrees, the slope for item 1 is 0.233, while the 
slope for item 2 is 0*279. At (t> = 45 degrees, both items have a slope of 
0.26!) at the points of inflection. 

It seems to follow from the above discussion and example that, in 
interpreting item discrimination in the multidimensional case, the particular 
composite of abilities of interest must be considered. The composite might 
be specified a priori, as in test construction, or discovered by post 
administration analyses. 



Sufficient Statistics 

The notion of a sufficient statistic is not a sample one to grasp. 
Essentially, a statistic t is a sufficient statistic for the parameter 0 if 
it contains all the information in the sample data about 0. For example, 
the number-correct score for an item provides all the information in the 
response data about the d^-parameter. For the a-parameter for a particular 
dimension, a sufficient statistic is provided by a weighted sum of the item 
responses. The response of each examinee to the item of interest is weighted 
by the ex;jminee's ability on the dimension of interest* Thus, a correct 
response to an item by an examinee of high ability (6 > 0.0) adds to the 
value of the statistic, while a correct response by an examinee of low 
ability (0 < 0.0) decreases the value. 

For ability, 3 sufficient statistic is provided by a weighted sum of 
an examinee^s responses to all the items. The weighting factor is the 
discrimination parameter for the dimension of interest. Thus, a correct 
response to a highly discriminating Item adds more to the statistic than a 
correct response to an item of low discriminating power. 

Although the availability of sufficient statistics for the parameters 
of the M2PL model is important from an estimation standpoint, it should be 
pointed out that> with the exception of the d-parameter> the sufficient 
statistics described above are not observable. While the number-correct 
score of an item can be observed, the o-parameter of an item, and thus the 
Sum of item responses weighted by discrimination parameters, is not observohle. 
This complicates estimation somewhat by requiring that provisional estimates 
of some parameters be provided during tlie estlntation of the remaining 
parameters. Solutions, then, are obtoined by a series of approximotions by 
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varying from one step to the next which parameters are estimated. In each 
step the provisional estimates for the parameters not being estimated are 
the most recent estimates of those parameters. 



Information Function 

Item information in the multidimensional case is like item discrimina^ 
tion in that the information yielded by an item for a particular 6 varies 
with the direction of travel. This has important implications for such 
applications as adaptive testing, in which items may be selected for 
administration on the basis of item information. As was the case with item 
discrimination, the interpretation and use of item information requires the 
consideration of the particular composite of abilities which is of interest. 

Maximum Likelihood Estimation 

Estimation of the parameters of the N2PL model is surprisingly straight- 
forward- j.mplemenLation of the procedure described earlier in this report 
is not particularly difficult. However, it is rather expensive. 

One serious limitation of the procedure described is that there is no 
way to determine in advance how many dimensions should be included. The 
procedure is too expensive to allow successive runs for an increasing 
number of dimensions until a satisfactory solution is obtained. It is 
clear that, if the M2PL model is to be used, more work is needed in this 
area ^ 

More work also needs to be done to determine sample size requirements 
for estimation. Some giiidelines are needed for determining the maximum 
number of items and subjects required for good estimation. 



Item response theory has become an increasingly popular area for 
research and application in recent years. Areas where item response theory 
has been applied include test scoring (Wcodcock, 1974), criterion-referenced 
measurement (Hambleton, Swaminathan, Cook, Eignor, and Gifford, 1978), test 
equating (Marco, 1977; ^ientz and Bashaw, 1977), adaptive testing (McKinley 
and Reckase, 1980), and mastery testing (Patience and Reckase, 1978). 
While many of these applications have been successful, one unsolved problem 
is repeatedly encountered— most IRT models assume unidimensionality . As a 
result, applications of these models have been limited to areas iii which 
the tests used measure predominantly one factor (or can be sorted into 
subtests which measure predominantly one factor). When the assumption of 
unidimensionality is not met, most IRT models are inappropriate. 

The purpose of this report is to present an IRT model that does not 
require unidimensional tests. With such a model the great power of item 
response theory as a measurement tool can be applied for many of the pur- 
poses for which unidimensional models are employed, without the limitation 
on what kinds of tests are involved (i-e., the dimensionality of the tests). 
Of course, much more work is needed before the model can be employed in 
real testing situations. Procedures for the use of the model for different 
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applications must be worked out in greater detail^ and limitations on the 
practicality of the estimation procedures must be overcome. The information 
provided in this report provides a firm foundation for future work in these 
areas . 
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